VocabAnalyzer: A Referred Word List Analyzing Tool with Keyword, Concordancing and N-gram Functions

نویسندگان

  • Siaw-Fong Chung
  • F. Y. August Chao
  • Yi-Chen Hsieh
چکیده

This paper introduces the newly created VocabAnalyzer which is equipped with keyword, concordancing and n-gram functions. The VocabAnalyzer also allows the comparison of the inputted text against Jeng et al. (2002) vocabulary word list. Two case studies will be discussed in this paper. The first study compares two versions of the English Bible and the second study compares word list created by abstracts written by graduate students of various English departments in Taiwan. Both study shows that the VocabAnalyzer is beneficial for applied linguistics studies as well as for teaching and learning of English. Analyses of student writing can be carried out based on the results from this tool.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A Comparison of Word- and Term-based Methods for Automatic Web Site Summarization

Automatic Web site summarization is an effective means of making the content of a web site easily accessible to Web users. We demonstrate that a content-based approach to summarization, which is based on keyword and key sentence extraction from narrative text, is able to generate summaries that are as informative as human authored summaries. This work is directed towards summary generation base...

متن کامل

VARD 2: A tool for dealing with spelling variation in historical corpora

When applying corpus linguistic techniques to historical corpora, the corpus researcher should be cautious about the results obtained. Corpus annotation techniques such as part of speech tagging, trained for modern languages, are particularly vulnerable to inaccuracy due to vocabulary and grammatical shifts in language over time. Basic corpus retrieval techniques such as frequency profiling and...

متن کامل

A Two-pass Strategy for Handli Vocabulary Recognit

This paper addresses the issue of large-vocabulary recognition in a specific word class. We propose a two-pass strategy in which only major cities are explicitly represented in the first stage lexicon. An unknown word model encoded as a phone loop is used to detect OOV city names (referred to as rare city names). After which SpeM, a tool that can extract words and word-initial cohorts from phon...

متن کامل

A Two-pass Strategy for Handling OOVs in a Large Vocabulary Recognition Task

This paper addresses the issue of large-vocabulary recognition in a specific word class. We propose a two-pass strategy in which only major cities are explicitly represented in the first stage lexicon. An unknown word model encoded as a phone loop is used to detect OOV city names (referred to as rare city names). After which SpeM, a tool that can extract words and word-initial cohorts from phon...

متن کامل

Increasing Interoperability for Embedding Corpus Annotation Pipelines in Wmatrix and other corpus retrieval tools

Computational tools and methods employed in corpus linguistics are split into three main types: compilation, annotation and retrieval. These mirror and support the usual corpus linguistics methodology of corpus collection, manual and/or automatic tagging, followed by query and analysis. Typically, corpus software to support retrieval implements some or all of the five major methods in corpus li...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2009